56 research outputs found

    Visual-Semantic Learning

    Get PDF
    Visual-semantic learning is an attractive and challenging research direction aiming to understand complex semantics of heterogeneous data from two domains, i.e., visual signals (i.e., images and videos) and natural language (i.e., captions and questions). It requires memorizing the rich information in a single modality and a joint comprehension of multiple modalities. Artificial intelligence (AI) systems with human-level intelligence are claimed to learn like humans, such as efficiently leveraging brain memory for better comprehension, rationally incorporating common-sense knowledge into reasoning, quickly gaining in-depth understanding given a few samples, and analyzing relationships among abundant and informative events. However, these intelligence capacities are effortless for humans but challenging for machines. To bridge the discrepancy between human-level intelligence and present-day visual-semantic learning, we start from its basic understanding ability by studying the visual question answering (e.g., Image-QA and Video-QA) tasks from the perspectives of memory augmentation and common-sense knowledge incorporation. Furthermore, we stretch it to a more challenging situation with limited and partially unlabeled training data (i.e., Few-shot Visual-Semantic Learning) to imitate the fast learning ability of humans. Finally, to further enhance visual-semantic performance in natural videos with numerous spatio-temporal dynamics, we investigate exploiting event-correlated information for a comprehensive understanding of cross-modal semantics. To study the essential visual-semantic understanding ability of the human brain with memory, we first propose a novel Memory Augmented Deep Recurrent Neural Network (i.e., MA-DRNN) model for Video-QA, which features a new method for encoding videos and questions, and memory augmentation using the emerging Differentiable Neural Computer (i.e., DNC). Specifically, we encode semantic (i.e., questions) information before visual (i.e., videos) information, which leads to better visual-semantic representations. Moreover, we leverage Differentiable Neural Computer (with external memory) to store and retrieve valuable information in questions and videos and model the long-term visual-semantic dependency. In addition to basic understanding, to tackle visual-semantic reasoning that requires external knowledge beyond visible contents (e.g., KB-Image-QA), we propose a novel framework that endows the model with capabilities of answering more general questions and achieves better exploitation of external knowledge through generating Multiple Clues for Reasoning with Memory Neural Networks (i.e., MCR-MemNN). Specifically, a well-defined detector is adopted to predict image-question-related relation phrases, each delivering two complementary clues to retrieve the supporting facts from an external knowledge base (i.e., KB). These facts are encoded into a continuous embedding space using a content-addressable memory. Afterward, mutual interactions between visual-semantic representation and the supporting facts stored in memory are captured to distill the most relevant information in three modalities (i.e., image, question, and KB). Finally, the optimal answer is predicted by choosing the supporting fact with the highest score. Furthermore, to enable a fast, in-depth understanding given a small number of samples, especially with heterogeneity in the multi-modal scenarios such as image question answering (i.e., Image-QA) and image captioning (i.e., IC), we study the few-shot visual-semantic learning and present the Hierarchical Graph ATtention Network (i.e., HGAT). This two-stage network models the intra- and inter-modal relationships with limited image-text samples. The main contributions of HGAT can be summarized as follows: 1) it sheds light on tackling few-shot multi-modal learning problems, which focuses primarily, but not exclusively, on visual and semantic modalities, through better exploitation of the intra-relationship of each modality and an attention-based co-learning framework between modalities using a hierarchical graph-based architecture; 2) it achieves superior performance on both visual question answering and image captioning in the few-shot setting; 3) it can be easily extended to the semi-supervised setting where image-text samples are partially unlabeled. Although various attention mechanisms have been utilized to manage contextualized representations by modeling intra- and inter-modal relationships of the two modalities, one limitation of the predominant visual-semantic methods is the lack of reasoning with event correlation, sensing, and analyzing relationships among abundant and informative events contained in the video. To this end, we introduce the dense caption modality as a new auxiliary and distill event-correlated information to infer the correct answer. We propose a novel end-to-end trainable model, Event-Correlated Graph Neural Networks (EC-GNNs), to perform cross-modal reasoning over information from the three modalities (i.e., caption, video, and question). Besides exploiting a new modality, we employ cross-modal reasoning modules to explicitly model inter-modal relationships and aggregate relevant information across different modalities. We propose a question-guided self-adaptive multi-modal fusion module to collect the question-oriented and event-correlated evidence through multi-step reasoning. To evaluate our proposed models, we conduct extensive experiments on VTW, MSVD-QA, and TGIF-QA datasets for Video-QA task, Toronto COCO-QA, Visual Genome-QA datasets for few-shot Image-QA task, COCO-FITB dataset for few-shot IC task, and FVQA, Visual7W + ConceptNet datasets for KB-Image-QA task. The experimental results justify these models’ effectiveness and superiority over baseline methods

    Cognitive Best Worst Method for Multiattribute Decision-Making

    Get PDF

    Human MicroRNA Oncogenes and Tumor Suppressors Show Significantly Different Biological Patterns: From Functions to Targets

    Get PDF
    MicroRNAs (miRNAs) are small noncoding RNAs which play essential roles in many important biological processes. Therefore, their dysfunction is associated with a variety of human diseases, including cancer. Increasing evidence shows that miRNAs can act as oncogenes or tumor suppressors, and although there is great interest in research into these cancer-associated miRNAs, little is known about them. In this study, we performed a comprehensive analysis of putative human miRNA oncogenes and tumor suppressors. We found that miRNA oncogenes and tumor suppressors clearly show different patterns in function, evolutionary rate, expression, chromosome distribution, molecule size, free energy, transcription factors, and targets. For example, miRNA oncogenes are located mainly in the amplified regions in human cancers, whereas miRNA tumor suppressors are located mainly in the deleted regions. miRNA oncogenes tend to cleave target mRNAs more frequently than miRNA tumor suppressors. These results indicate that these two types of cancer-associated miRNAs play different roles in cancer formation and development. Moreover, the patterns identified here can discriminate novel miRNA oncogenes and tumor suppressors with a high degree of accuracy. This study represents the first large-scale bioinformatic analysis of human miRNA oncogenes and tumor suppressors. Our findings provide help for not only understanding of miRNAs in cancer but also for the specific identification of novel miRNAs as miRNA oncogenes and tumor suppressors. In addition, the data presented in this study will be valuable for the study of both miRNAs and cancer

    Two-dimensional nitrogen and phosphorus co-doped mesoporous carbon-graphene nanosheets anode for high-performance potassium-ion capacitor

    Get PDF
    Heteroatom-doped carbon materials have high gravimetric potassium-ion storage capability because of their abundant active sites and defects. However, their practical applications toward potassium storage are limited by sluggish reaction kinetics and short cycling life owing to the large ionic radius of K+ and undesirable parasitic reactions. Herein, we report a new strategy that allows for bottom-up patterning of thin N/P co-doped carbon layers with a uniform mesoporous structure on two-dimensional graphene sheets. The highly porous architecture and N/P co-doping properties provide abundant active sites for K+, and the graphene sheets promote charge/electron transfer. This synergistic structure enables excellent K+ storage performance in terms of specific capacity (387.6 mAh g-1 at 0.05 A g-1), rate capability (over 5 A g-1), and cycling stability (70% after 3,000 cycles). As a proof of concept, a potassium-ion capacitor assembled using this carbon anode yields a high energy density of 107 Wh kg-1, a maximum power density of 18.3 kW kg-1, and ultra-long cycling stability over 40,000 cycles

    Metal Oxide Gas Sensors: Sensitivity and Influencing Factors

    Get PDF
    Conductometric semiconducting metal oxide gas sensors have been widely used and investigated in the detection of gases. Investigations have indicated that the gas sensing process is strongly related to surface reactions, so one of the important parameters of gas sensors, the sensitivity of the metal oxide based materials, will change with the factors influencing the surface reactions, such as chemical components, surface-modification and microstructures of sensing layers, temperature and humidity. In this brief review, attention will be focused on changes of sensitivity of conductometric semiconducting metal oxide gas sensors due to the five factors mentioned above

    Genome-wide association meta-analyses and fine-mapping elucidate pathways influencing albuminuria

    Get PDF
    Abstract: Increased levels of the urinary albumin-to-creatinine ratio (UACR) are associated with higher risk of kidney disease progression and cardiovascular events, but underlying mechanisms are incompletely understood. Here, we conduct trans-ethnic (n = 564,257) and European-ancestry specific meta-analyses of genome-wide association studies of UACR, including ancestry- and diabetes-specific analyses, and identify 68 UACR-associated loci. Genetic correlation analyses and risk score associations in an independent electronic medical records database (n = 192,868) reveal connections with proteinuria, hyperlipidemia, gout, and hypertension. Fine-mapping and trans-Omics analyses with gene expression in 47 tissues and plasma protein levels implicate genes potentially operating through differential expression in kidney (including TGFB1, MUC1, PRKCI, and OAF), and allow coupling of UACR associations to altered plasma OAF concentrations. Knockdown of OAF and PRKCI orthologs in Drosophila nephrocytes reduces albumin endocytosis. Silencing fly PRKCI further impairs slit diaphragm formation. These results generate a priority list of genes and pathways for translational research to reduce albuminuria

    Genome-wide association meta-analyses and fine-mapping elucidate pathways influencing albuminuria

    Get PDF
    Publisher Copyright: © 2019, The Author(s).Increased levels of the urinary albumin-to-creatinine ratio (UACR) are associated with higher risk of kidney disease progression and cardiovascular events, but underlying mechanisms are incompletely understood. Here, we conduct trans-ethnic (n = 564,257) and European-ancestry specific meta-analyses of genome-wide association studies of UACR, including ancestry- and diabetes-specific analyses, and identify 68 UACR-associated loci. Genetic correlation analyses and risk score associations in an independent electronic medical records database (n = 192,868) reveal connections with proteinuria, hyperlipidemia, gout, and hypertension. Fine-mapping and trans-Omics analyses with gene expression in 47 tissues and plasma protein levels implicate genes potentially operating through differential expression in kidney (including TGFB1, MUC1, PRKCI, and OAF), and allow coupling of UACR associations to altered plasma OAF concentrations. Knockdown of OAF and PRKCI orthologs in Drosophila nephrocytes reduces albumin endocytosis. Silencing fly PRKCI further impairs slit diaphragm formation. These results generate a priority list of genes and pathways for translational research to reduce albuminuria.Peer reviewe

    Genome-wide association meta-analyses and fine-mapping elucidate pathways influencing albuminuria

    Get PDF
    Increased levels of the urinary albumin-to-creatinine ratio (UACR) are associated with higher risk of kidney disease progression and cardiovascular events, but underlying mechanisms are incompletely understood. Here, we conduct trans-ethnic (n = 564,257) and European-ancestry specific meta-analyses of genome-wide association studies of UACR, including ancestry- and diabetes-specific analyses, and identify 68 UACR-associated loci. Genetic correlation analyses and risk score associations in an independent electronic medical records database (n = 192,868) reveal connections with proteinuria, hyperlipidemia, gout, and hypertension. Fine-mapping and trans-Omics analyses with gene expression in 47 tissues and plasma protein levels implicate genes potentially operating through differential expression in kidney (including TGFB1, MUC1, PRKCI, and OAF), and allow coupling of UACR associations to altered plasma OAF concentrations. Knockdown of OAF and PRKCI orthologs in Drosophila nephrocytes reduces albumin endocytosis. Silencing fly PRKCI further impairs slit diaphragm formation. These results generate a priority list of genes and pathways for translational research to reduce albuminuria
    • …
    corecore